Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 40000 |
| Missing cells | 44194 |
| Missing cells (%) | 3.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.8 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 11 |
| Unsupported | 1 |
| Boolean | 10 |
QUANT_DEPENDANTS has constant value "0" | Constant |
FLAG_OTHER_CARD has constant value "False" | Constant |
QUANT_BANKING_ACCOUNTS has constant value "0" | Constant |
FLAG_MOBILE_PHONE has constant value "False" | Constant |
FLAG_CONTACT_PHONE has constant value "False" | Constant |
COD_APPLICATION_BOOTH has constant value "0" | Constant |
FLAG_CARD_INSURANCE_OPTION has constant value "False" | Constant |
PERSONAL_REFERENCE_#1 has a high cardinality: 17572 distinct values | High cardinality |
PERSONAL_REFERENCE_#2 has a high cardinality: 13672 distinct values | High cardinality |
QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
FLAG_MOBILE_PHONE is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
FLAG_RESIDENCIAL_PHONE is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
RESIDENCE_TYPE is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
FLAG_MOTHERS_NAME is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
FLAG_CARD_INSURANCE_OPTION is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
FLAG_CONTACT_PHONE is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
QUANT_BANKING_ACCOUNTS is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
MARITAL_STATUS is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
COD_APPLICATION_BOOTH is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
FLAG_FATHERS_NAME is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
SEX is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
FLAG_OTHER_CARD is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
FLAG_RESIDENCE_STATE=WORKING_STATE is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
FLAG_RESIDENCIAL_ADDRESS=POSTAL_ADDRESS is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
SHOP_RANK is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
TARGET_LABEL_BAD=1 is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
QUANT_DEPENDANTS is highly correlated with QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION and 17 other fields | High correlation |
FLAG_RESIDENCE_TOWN=WORKING_TOWN is highly correlated with FLAG_MOBILE_PHONE and 6 other fields | High correlation |
MARITAL_STATUS is highly correlated with AGE | High correlation |
AGE is highly correlated with MARITAL_STATUS | High correlation |
FLAG_RESIDENCIAL_PHONE is highly correlated with AREA_CODE_RESIDENCIAL_PHONE | High correlation |
AREA_CODE_RESIDENCIAL_PHONE is highly correlated with FLAG_RESIDENCIAL_PHONE | High correlation |
EDUCATION has 40000 (100.0%) missing values | Missing |
PERSONAL_REFERENCE_#2 has 4190 (10.5%) missing values | Missing |
MATE_INCOME is highly skewed (γ1 = 73.04313799) | Skewed |
PERSONAL_NET_INCOME is highly skewed (γ1 = 55.52751098) | Skewed |
ID_CLIENT is uniformly distributed | Uniform |
ID_CLIENT has unique values | Unique |
EDUCATION is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
MONTHS_IN_RESIDENCE has 774 (1.9%) zeros | Zeros |
MONTHS_IN_THE_JOB has 8347 (20.9%) zeros | Zeros |
MATE_INCOME has 38411 (96.0%) zeros | Zeros |
PERSONAL_NET_INCOME has 956 (2.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-06-22 21:46:09.116427 |
|---|---|
| Analysis finished | 2022-06-22 21:51:11.991337 |
| Duration | 5 minutes and 2.87 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 40000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24982.227 |
| Minimum | 2 |
|---|---|
| Maximum | 50000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2475.9 |
| Q1 | 12458.75 |
| median | 25058.5 |
| Q3 | 37425.25 |
| 95-th percentile | 47518.1 |
| Maximum | 50000 |
| Range | 49998 |
| Interquartile range (IQR) | 24966.5 |
Descriptive statistics
| Standard deviation | 14428.53176 |
|---|---|
| Coefficient of variation (CV) | 0.5775518635 |
| Kurtosis | -1.196545227 |
| Mean | 24982.227 |
| Median Absolute Deviation (MAD) | 12483 |
| Skewness | -0.0005505929594 |
| Sum | 999289080 |
| Variance | 208182528.7 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 33275 | 1 | < 0.1% |
| 33267 | 1 | < 0.1% |
| 33269 | 1 | < 0.1% |
| 33270 | 1 | < 0.1% |
| 33271 | 1 | < 0.1% |
| 33272 | 1 | < 0.1% |
| 33273 | 1 | < 0.1% |
| 33274 | 1 | < 0.1% |
| 33276 | 1 | < 0.1% |
| Other values (39990) | 39990 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 |
| Value | Count | Frequency (%) |
| 50000 | 1 | |
| 49998 | 1 | |
| 49996 | 1 | |
| 49995 | 1 | |
| 49994 | 1 | |
| 49993 | 1 | |
| 49992 | 1 | |
| 49991 | 1 | |
| 49990 | 1 | |
| 49989 | 1 |
ID_SHOP
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.82295 |
| Minimum | 1 |
|---|---|
| Maximum | 96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 21 |
| Q3 | 24 |
| 95-th percentile | 55 |
| Maximum | 96 |
| Range | 95 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 14.57191346 |
|---|---|
| Coefficient of variation (CV) | 0.6998006266 |
| Kurtosis | 4.02119351 |
| Mean | 20.82295 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.683848981 |
| Sum | 832918 |
| Variance | 212.3406618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 5356 | 13.4% |
| 22 | 3914 | 9.8% |
| 24 | 3502 | 8.8% |
| 55 | 3409 | 8.5% |
| 23 | 2375 | 5.9% |
| 20 | 1811 | 4.5% |
| 1 | 1616 | 4.0% |
| 12 | 1551 | 3.9% |
| 15 | 1542 | 3.9% |
| 19 | 1384 | 3.5% |
| Other values (21) | 13540 |
| Value | Count | Frequency (%) |
| 1 | 1616 | |
| 2 | 616 | 1.5% |
| 3 | 1356 | |
| 4 | 540 | 1.4% |
| 5 | 503 | 1.3% |
| 6 | 577 | 1.4% |
| 7 | 680 | |
| 8 | 489 | 1.2% |
| 9 | 986 | |
| 10 | 1294 |
| Value | Count | Frequency (%) |
| 96 | 179 | 0.4% |
| 81 | 5 | < 0.1% |
| 77 | 1 | < 0.1% |
| 66 | 371 | 0.9% |
| 55 | 3409 | |
| 50 | 5 | < 0.1% |
| 25 | 5356 | |
| 24 | 3502 | |
| 23 | 2375 | |
| 22 | 3914 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 312.6 KiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39997 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | M |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 27903 | |
| M | 12094 | |
| (Missing) | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| f | 27903 | |
| m | 12094 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 27903 | |
| M | 12094 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39997 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 27903 | |
| M | 12094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39997 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 27903 | |
| M | 12094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39997 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 27903 | |
| M | 12094 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| S | |
|---|---|
| C | |
| O | |
| V | 1961 |
| D | 1723 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | C |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 20375 | |
| C | 13721 | |
| O | 2220 | 5.5% |
| V | 1961 | 4.9% |
| D | 1723 | 4.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| s | 20375 | |
| c | 13721 | |
| o | 2220 | 5.5% |
| v | 1961 | 4.9% |
| d | 1723 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 20375 | |
| C | 13721 | |
| O | 2220 | 5.5% |
| V | 1961 | 4.9% |
| D | 1723 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 40000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 20375 | |
| C | 13721 | |
| O | 2220 | 5.5% |
| V | 1961 | 4.9% |
| D | 1723 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 20375 | |
| C | 13721 | |
| O | 2220 | 5.5% |
| V | 1961 | 4.9% |
| D | 1723 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 20375 | |
| C | 13721 | |
| O | 2220 | 5.5% |
| V | 1961 | 4.9% |
| D | 1723 | 4.3% |
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.649725 |
| Minimum | 15 |
|---|---|
| Maximum | 88 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 23 |
| median | 33 |
| Q3 | 43 |
| 95-th percentile | 60 |
| Maximum | 88 |
| Range | 73 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.07620003 |
|---|---|
| Coefficient of variation (CV) | 0.3773825052 |
| Kurtosis | -0.01990747593 |
| Mean | 34.649725 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.7616857594 |
| Sum | 1385989 |
| Variance | 170.9870071 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 2022 | 5.1% |
| 19 | 2007 | 5.0% |
| 21 | 1808 | 4.5% |
| 22 | 1567 | 3.9% |
| 18 | 1486 | 3.7% |
| 23 | 1351 | 3.4% |
| 24 | 1244 | 3.1% |
| 25 | 1170 | 2.9% |
| 28 | 1117 | 2.8% |
| 26 | 1073 | 2.7% |
| Other values (62) | 25155 |
| Value | Count | Frequency (%) |
| 15 | 5 | < 0.1% |
| 16 | 12 | < 0.1% |
| 17 | 72 | 0.2% |
| 18 | 1486 | |
| 19 | 2007 | |
| 20 | 2022 | |
| 21 | 1808 | |
| 22 | 1567 | |
| 23 | 1351 | |
| 24 | 1244 |
| Value | Count | Frequency (%) |
| 88 | 2 | < 0.1% |
| 86 | 1 | < 0.1% |
| 84 | 1 | < 0.1% |
| 83 | 6 | |
| 82 | 6 | |
| 81 | 7 | |
| 80 | 7 | |
| 79 | 10 | |
| 78 | 11 | |
| 77 | 13 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 40000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 40000 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 32649 | |
| False | 7351 | 18.4% |
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.812275 |
| Minimum | 1 |
|---|---|
| Maximum | 70 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 31 |
| median | 31 |
| Q3 | 31 |
| 95-th percentile | 50 |
| Maximum | 70 |
| Range | 69 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 10.4029414 |
|---|---|
| Coefficient of variation (CV) | 0.3076675972 |
| Kurtosis | 1.285351655 |
| Mean | 33.812275 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.2287684343 |
| Sum | 1352491 |
| Variance | 108.2211899 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 28080 | |
| 50 | 8884 | 22.2% |
| 5 | 1951 | 4.9% |
| 23 | 793 | 2.0% |
| 24 | 90 | 0.2% |
| 49 | 37 | 0.1% |
| 32 | 26 | 0.1% |
| 27 | 20 | 0.1% |
| 42 | 12 | < 0.1% |
| 38 | 10 | < 0.1% |
| Other values (49) | 97 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1951 | |
| 6 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 70 | 1 | < 0.1% |
| 69 | 2 | < 0.1% |
| 68 | 6 | |
| 67 | 2 | < 0.1% |
| 65 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 63 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
PAYMENT_DAY
Real number (ℝ≥0)
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.31395 |
| Minimum | 1 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 9 |
| median | 12 |
| Q3 | 20 |
| 95-th percentile | 28 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 7.159756766 |
|---|---|
| Coefficient of variation (CV) | 0.4675316797 |
| Kurtosis | -0.8118152063 |
| Mean | 15.31395 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.1133339367 |
| Sum | 612558 |
| Variance | 51.26211695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 10150 | |
| 8 | 7206 | |
| 18 | 6638 | |
| 20 | 5050 | |
| 28 | 3604 | 9.0% |
| 25 | 2705 | 6.8% |
| 3 | 1563 | 3.9% |
| 23 | 1157 | 2.9% |
| 1 | 1091 | 2.7% |
| 16 | 213 | 0.5% |
| Other values (6) | 623 | 1.6% |
| Value | Count | Frequency (%) |
| 1 | 1091 | 2.7% |
| 3 | 1563 | 3.9% |
| 6 | 124 | 0.3% |
| 8 | 7206 | |
| 9 | 104 | 0.3% |
| 11 | 213 | 0.5% |
| 12 | 10150 | |
| 15 | 1 | < 0.1% |
| 16 | 213 | 0.5% |
| 18 | 6638 |
| Value | Count | Frequency (%) |
| 28 | 3604 | 9.0% |
| 27 | 54 | 0.1% |
| 25 | 2705 | 6.8% |
| 23 | 1157 | 2.9% |
| 22 | 127 | 0.3% |
| 20 | 5050 | |
| 18 | 6638 | |
| 16 | 213 | 0.5% |
| 15 | 1 | < 0.1% |
| 12 | 10150 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 | |
|---|---|
| 3 | 178 |
| 2 | 52 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 39770 | |
| 3 | 178 | 0.4% |
| 2 | 52 | 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| P | |
|---|---|
| A | |
| C | |
| O | 1647 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P |
|---|---|
| 2nd row | P |
| 3rd row | O |
| 4th row | P |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| P | 29752 | |
| A | 5130 | 12.8% |
| C | 3471 | 8.7% |
| O | 1647 | 4.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| p | 29752 | |
| a | 5130 | 12.8% |
| c | 3471 | 8.7% |
| o | 1647 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 29752 | |
| A | 5130 | 12.8% |
| C | 3471 | 8.7% |
| O | 1647 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 40000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 29752 | |
| A | 5130 | 12.8% |
| C | 3471 | 8.7% |
| O | 1647 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 29752 | |
| A | 5130 | 12.8% |
| C | 3471 | 8.7% |
| O | 1647 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 29752 | |
| A | 5130 | 12.8% |
| C | 3471 | 8.7% |
| O | 1647 | 4.1% |
| Distinct | 76 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 152.9211 |
| Minimum | 0 |
|---|---|
| Maximum | 1188 |
| Zeros | 774 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 36 |
| median | 120 |
| Q3 | 240 |
| 95-th percentile | 420 |
| Maximum | 1188 |
| Range | 1188 |
| Interquartile range (IQR) | 204 |
Descriptive statistics
| Standard deviation | 136.0969619 |
|---|---|
| Coefficient of variation (CV) | 0.889981578 |
| Kurtosis | 1.081098854 |
| Mean | 152.9211 |
| Median Absolute Deviation (MAD) | 96 |
| Skewness | 1.092675097 |
| Sum | 6116844 |
| Variance | 18522.38303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 4933 | 12.3% |
| 24 | 2902 | 7.3% |
| 120 | 2794 | 7.0% |
| 240 | 2768 | 6.9% |
| 36 | 2419 | 6.0% |
| 60 | 1966 | 4.9% |
| 48 | 1758 | 4.4% |
| 72 | 1502 | 3.8% |
| 180 | 1363 | 3.4% |
| 360 | 1316 | 3.3% |
| Other values (66) | 16279 |
| Value | Count | Frequency (%) |
| 0 | 774 | 1.9% |
| 12 | 4933 | |
| 24 | 2902 | |
| 36 | 2419 | |
| 48 | 1758 | 4.4% |
| 60 | 1966 | 4.9% |
| 72 | 1502 | 3.8% |
| 84 | 990 | 2.5% |
| 96 | 1222 | 3.1% |
| 108 | 664 | 1.7% |
| Value | Count | Frequency (%) |
| 1188 | 1 | < 0.1% |
| 1176 | 1 | < 0.1% |
| 1116 | 1 | < 0.1% |
| 1020 | 1 | < 0.1% |
| 900 | 1 | < 0.1% |
| 888 | 2 | |
| 852 | 1 | < 0.1% |
| 840 | 3 | |
| 828 | 2 | |
| 816 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| True | |
|---|---|
| False | 150 |
| Value | Count | Frequency (%) |
| True | 39850 | |
| False | 150 | 0.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| True | |
|---|---|
| False | 1658 |
| Value | Count | Frequency (%) |
| True | 38342 | |
| False | 1658 | 4.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 21740 | |
| True | 18260 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| True | |
|---|---|
| False | 347 |
| Value | Count | Frequency (%) |
| True | 39653 | |
| False | 347 | 0.9% |
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.6295 |
| Minimum | 0 |
|---|---|
| Maximum | 1176 |
| Zeros | 8347 |
| Zeros (%) | 20.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12 |
| median | 24 |
| Q3 | 60 |
| 95-th percentile | 228 |
| Maximum | 1176 |
| Range | 1176 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 73.87513904 |
|---|---|
| Coefficient of variation (CV) | 1.459132305 |
| Kurtosis | 8.923171323 |
| Mean | 50.6295 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 2.527016428 |
| Sum | 2025180 |
| Variance | 5457.536168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 11593 | |
| 0 | 8347 | |
| 24 | 4018 | 10.0% |
| 36 | 2899 | 7.2% |
| 60 | 1932 | 4.8% |
| 48 | 1893 | 4.7% |
| 72 | 1393 | 3.5% |
| 120 | 1249 | 3.1% |
| 96 | 777 | 1.9% |
| 84 | 776 | 1.9% |
| Other values (44) | 5123 |
| Value | Count | Frequency (%) |
| 0 | 8347 | |
| 12 | 11593 | |
| 24 | 4018 | 10.0% |
| 36 | 2899 | 7.2% |
| 48 | 1893 | 4.7% |
| 60 | 1932 | 4.8% |
| 72 | 1393 | 3.5% |
| 84 | 776 | 1.9% |
| 96 | 777 | 1.9% |
| 108 | 405 | 1.0% |
| Value | Count | Frequency (%) |
| 1176 | 1 | < 0.1% |
| 1104 | 1 | < 0.1% |
| 780 | 1 | < 0.1% |
| 708 | 1 | < 0.1% |
| 684 | 2 | |
| 660 | 1 | < 0.1% |
| 612 | 1 | < 0.1% |
| 600 | 3 | |
| 588 | 1 | < 0.1% |
| 552 | 1 | < 0.1% |
PROFESSION_CODE
Real number (ℝ≥0)
| Distinct | 291 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 484.611875 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 88 |
| median | 514 |
| Q3 | 865 |
| 95-th percentile | 999 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 777 |
Descriptive statistics
| Standard deviation | 382.1023719 |
|---|---|
| Coefficient of variation (CV) | 0.7884709221 |
| Kurtosis | -1.661763842 |
| Mean | 484.611875 |
| Median Absolute Deviation (MAD) | 408 |
| Skewness | 0.07352655483 |
| Sum | 19384475 |
| Variance | 146002.2226 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999 | 5084 | 12.7% |
| 950 | 3736 | 9.3% |
| 13 | 2002 | 5.0% |
| 205 | 1786 | 4.5% |
| 703 | 1547 | 3.9% |
| 26 | 1518 | 3.8% |
| 131 | 1046 | 2.6% |
| 514 | 996 | 2.5% |
| 60 | 959 | 2.4% |
| 40 | 774 | 1.9% |
| Other values (281) | 20552 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 48 | 0.1% |
| 2 | 110 | |
| 3 | 5 | < 0.1% |
| 4 | 209 | |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 10 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 999 | 5084 | |
| 992 | 238 | 0.6% |
| 991 | 63 | 0.2% |
| 990 | 1 | < 0.1% |
| 954 | 18 | < 0.1% |
| 953 | 223 | 0.6% |
| 952 | 155 | 0.4% |
| 951 | 6 | < 0.1% |
| 950 | 3736 | |
| 924 | 27 | 0.1% |
| Distinct | 561 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.99333325 |
| Minimum | 0 |
|---|---|
| Maximum | 70000 |
| Zeros | 38411 |
| Zeros (%) | 96.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 70000 |
| Range | 70000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 507.1591245 |
|---|---|
| Coefficient of variation (CV) | 9.945596653 |
| Kurtosis | 9259.924185 |
| Mean | 50.99333325 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 73.04313799 |
| Sum | 2039733.33 |
| Variance | 257210.3776 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38411 | |
| 1000 | 70 | 0.2% |
| 500 | 69 | 0.2% |
| 600 | 65 | 0.2% |
| 800 | 64 | 0.2% |
| 1500 | 57 | 0.1% |
| 2000 | 54 | 0.1% |
| 1200 | 52 | 0.1% |
| 400 | 52 | 0.1% |
| 700 | 44 | 0.1% |
| Other values (551) | 1062 | 2.7% |
| Value | Count | Frequency (%) |
| 0 | 38411 | |
| 1 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 155 | 1 | < 0.1% |
| 180 | 24 | 0.1% |
| 190 | 1 | < 0.1% |
| 196 | 2 | < 0.1% |
| 200 | 13 | < 0.1% |
| 201 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 70000 | 1 | < 0.1% |
| 22400 | 1 | < 0.1% |
| 21540 | 1 | < 0.1% |
| 11476 | 1 | < 0.1% |
| 10000 | 2 | |
| 8513 | 1 | < 0.1% |
| 8000 | 2 | |
| 7800 | 1 | < 0.1% |
| 7000 | 3 | |
| 6772 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| True | |
|---|---|
| False | 842 |
| Value | Count | Frequency (%) |
| True | 39158 | |
| False | 842 | 2.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 40000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 40000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 40000 |
| Distinct | 17572 |
|---|---|
| Distinct (%) | 43.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 312.6 KiB |
| MARIA | 552 |
|---|---|
| MARCIA | 269 |
| FATIMA | 254 |
| SONIA | 253 |
| SANDRA | 211 |
| Other values (17567) |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 10.03165079 |
| Min length | 1 |
Characters and Unicode
| Total characters | 401256 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 15474 ? |
|---|---|
| Unique (%) | 38.7% |
Sample
| 1st row | SARA |
|---|---|
| 2nd row | JACI |
| 3rd row | MARCIA CRISTINA ZANELLA |
| 4th row | MARCIO |
| 5th row | FABIO (NOIVO) |
Common Values
| Value | Count | Frequency (%) |
| MARIA | 552 | 1.4% |
| MARCIA | 269 | 0.7% |
| FATIMA | 254 | 0.6% |
| SONIA | 253 | 0.6% |
| SANDRA | 211 | 0.5% |
| MARIA JOSE | 198 | 0.5% |
| LUCIA | 197 | 0.5% |
| CLAUDIA | 190 | 0.5% |
| VERA | 188 | 0.5% |
| REGINA | 182 | 0.5% |
| Other values (17562) | 37505 |
Length
| Value | Count | Frequency (%) |
| maria | 3338 | 5.1% |
| 1265 | 1.9% | |
| de | 997 | 1.5% |
| silva | 978 | 1.5% |
| da | 870 | 1.3% |
| ana | 787 | 1.2% |
| jose | 715 | 1.1% |
| lucia | 687 | 1.1% |
| santos | 484 | 0.7% |
| fatima | 479 | 0.7% |
| Other values (8843) | 54276 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 70573 | |
| I | 42535 | |
| E | 37729 | |
| R | 31109 | 7.8% |
| 25651 | 6.4% | |
| N | 25059 | 6.2% |
| L | 24333 | 6.1% |
| O | 22522 | 5.6% |
| S | 18852 | 4.7% |
| M | 14743 | 3.7% |
| Other values (54) | 88150 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 370879 | |
| Space Separator | 25651 | 6.4% |
| Open Punctuation | 1247 | 0.3% |
| Close Punctuation | 1116 | 0.3% |
| Other Punctuation | 967 | 0.2% |
| Dash Punctuation | 864 | 0.2% |
| Decimal Number | 466 | 0.1% |
| Math Symbol | 59 | < 0.1% |
| Modifier Symbol | 6 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 70573 | |
| I | 42535 | |
| E | 37729 | |
| R | 31109 | |
| N | 25059 | 6.8% |
| L | 24333 | 6.6% |
| O | 22522 | 6.1% |
| S | 18852 | 5.1% |
| M | 14743 | 4.0% |
| D | 14630 | 3.9% |
| Other values (26) | 68794 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 85 | |
| 1 | 58 | |
| 6 | 53 | |
| 0 | 48 | |
| 7 | 42 | |
| 5 | 38 | |
| 9 | 37 | |
| 4 | 36 | |
| 3 | 35 | |
| 8 | 34 | 7.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 722 | |
| * | 131 | 13.5% |
| . | 99 | 10.2% |
| , | 8 | 0.8% |
| \ | 3 | 0.3% |
| ' | 3 | 0.3% |
| & | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1246 | |
| [ | 1 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1115 | |
| ] | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 58 | |
| > | 1 | 1.7% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 5 | |
| ´ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 25651 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 864 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 370879 | |
| Common | 30377 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 70573 | |
| I | 42535 | |
| E | 37729 | |
| R | 31109 | |
| N | 25059 | 6.8% |
| L | 24333 | 6.6% |
| O | 22522 | 6.1% |
| S | 18852 | 5.1% |
| M | 14743 | 4.0% |
| D | 14630 | 3.9% |
| Other values (26) | 68794 |
Common
| Value | Count | Frequency (%) |
| 25651 | ||
| ( | 1246 | 4.1% |
| ) | 1115 | 3.7% |
| - | 864 | 2.8% |
| / | 722 | 2.4% |
| * | 131 | 0.4% |
| . | 99 | 0.3% |
| 2 | 85 | 0.3% |
| = | 58 | 0.2% |
| 1 | 58 | 0.2% |
| Other values (18) | 348 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 401057 | |
| None | 199 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 70573 | |
| I | 42535 | |
| E | 37729 | |
| R | 31109 | 7.8% |
| 25651 | 6.4% | |
| N | 25059 | 6.2% |
| L | 24333 | 6.1% |
| O | 22522 | 5.6% |
| S | 18852 | 4.7% |
| M | 14743 | 3.7% |
| Other values (43) | 87951 |
None
| Value | Count | Frequency (%) |
| Ç | 127 | |
| Ã | 35 | 17.6% |
| Á | 11 | 5.5% |
| Í | 7 | 3.5% |
| É | 6 | 3.0% |
| Â | 3 | 1.5% |
| À | 3 | 1.5% |
| Ú | 3 | 1.5% |
| Ô | 2 | 1.0% |
| ´ | 1 | 0.5% |
| Distinct | 13672 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 4190 |
| Missing (%) | 10.5% |
| Memory size | 312.6 KiB |
| MARIA | 550 |
|---|---|
| MARCIA | 274 |
| SONIA | 266 |
| FATIMA | 259 |
| SANDRA | 231 |
| Other values (13667) |
Length
| Max length | 32 |
|---|---|
| Median length | 23 |
| Mean length | 8.937084613 |
| Min length | 1 |
Characters and Unicode
| Total characters | 320037 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 11673 ? |
|---|---|
| Unique (%) | 32.6% |
Sample
| 1st row | FELIPE |
|---|---|
| 2nd row | VALERIA ALEXANDRA TRAJANO |
| 3rd row | SANDRO L P MARTINS |
| 4th row | ANA |
| 5th row | EDU (AVO) |
Common Values
| Value | Count | Frequency (%) |
| MARIA | 550 | 1.4% |
| MARCIA | 274 | 0.7% |
| SONIA | 266 | 0.7% |
| FATIMA | 259 | 0.6% |
| SANDRA | 231 | 0.6% |
| LUCIA | 228 | 0.6% |
| ANA | 223 | 0.6% |
| VERA | 210 | 0.5% |
| CLAUDIA | 193 | 0.5% |
| REGINA | 189 | 0.5% |
| Other values (13662) | 33187 | |
| (Missing) | 4190 | 10.5% |
Length
| Value | Count | Frequency (%) |
| maria | 2369 | 4.5% |
| 1139 | 2.2% | |
| ana | 724 | 1.4% |
| de | 569 | 1.1% |
| jose | 559 | 1.1% |
| lucia | 550 | 1.1% |
| silva | 513 | 1.0% |
| da | 511 | 1.0% |
| fatima | 403 | 0.8% |
| marcia | 392 | 0.8% |
| Other values (7840) | 44337 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 57006 | |
| I | 34826 | |
| E | 30120 | |
| R | 24664 | 7.7% |
| N | 20985 | 6.6% |
| L | 20063 | 6.3% |
| O | 17898 | 5.6% |
| 16957 | 5.3% | |
| S | 14186 | 4.4% |
| M | 11750 | 3.7% |
| Other values (54) | 71582 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 298880 | |
| Space Separator | 16957 | 5.3% |
| Open Punctuation | 1164 | 0.4% |
| Close Punctuation | 1088 | 0.3% |
| Dash Punctuation | 818 | 0.3% |
| Other Punctuation | 676 | 0.2% |
| Decimal Number | 396 | 0.1% |
| Math Symbol | 57 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 57006 | |
| I | 34826 | |
| E | 30120 | |
| R | 24664 | |
| N | 20985 | 7.0% |
| L | 20063 | 6.7% |
| O | 17898 | 6.0% |
| S | 14186 | 4.7% |
| M | 11750 | 3.9% |
| D | 11603 | 3.9% |
| Other values (27) | 55779 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 67 | |
| 2 | 58 | |
| 6 | 42 | |
| 1 | 41 | |
| 7 | 36 | |
| 0 | 36 | |
| 4 | 34 | |
| 8 | 32 | |
| 9 | 25 | 6.3% |
| 5 | 25 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 509 | |
| . | 87 | 12.9% |
| * | 60 | 8.9% |
| \ | 8 | 1.2% |
| ' | 6 | 0.9% |
| , | 3 | 0.4% |
| ; | 2 | 0.3% |
| & | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1162 | |
| [ | 2 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1087 | |
| ] | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 53 | |
| + | 4 | 7.0% |
Space Separator
| Value | Count | Frequency (%) |
| 16957 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 818 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 298880 | |
| Common | 21157 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 57006 | |
| I | 34826 | |
| E | 30120 | |
| R | 24664 | |
| N | 20985 | 7.0% |
| L | 20063 | 6.7% |
| O | 17898 | 6.0% |
| S | 14186 | 4.7% |
| M | 11750 | 3.9% |
| D | 11603 | 3.9% |
| Other values (27) | 55779 |
Common
| Value | Count | Frequency (%) |
| 16957 | ||
| ( | 1162 | 5.5% |
| ) | 1087 | 5.1% |
| - | 818 | 3.9% |
| / | 509 | 2.4% |
| . | 87 | 0.4% |
| 3 | 67 | 0.3% |
| * | 60 | 0.3% |
| 2 | 58 | 0.3% |
| = | 53 | 0.3% |
| Other values (17) | 299 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319838 | |
| None | 199 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 57006 | |
| I | 34826 | |
| E | 30120 | |
| R | 24664 | 7.7% |
| N | 20985 | 6.6% |
| L | 20063 | 6.3% |
| O | 17898 | 5.6% |
| 16957 | 5.3% | |
| S | 14186 | 4.4% |
| M | 11750 | 3.7% |
| Other values (43) | 71383 |
None
| Value | Count | Frequency (%) |
| Ç | 125 | |
| Ã | 27 | 13.6% |
| É | 19 | 9.5% |
| Á | 10 | 5.0% |
| Ú | 7 | 3.5% |
| Í | 6 | 3.0% |
| À | 1 | 0.5% |
| Ê | 1 | 0.5% |
| Ì | 1 | 0.5% |
| Ô | 1 | 0.5% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 40000 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 40000 |
| Distinct | 2315 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9752.711101 |
| Minimum | 0 |
|---|---|
| Maximum | 38529098 |
| Zeros | 956 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 312.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 180 |
| Q1 | 270 |
| median | 400 |
| Q3 | 738 |
| 95-th percentile | 1739 |
| Maximum | 38529098 |
| Range | 38529098 |
| Interquartile range (IQR) | 468 |
Descriptive statistics
| Standard deviation | 485633.5056 |
|---|---|
| Coefficient of variation (CV) | 49.79471867 |
| Kurtosis | 3211.137561 |
| Mean | 9752.711101 |
| Median Absolute Deviation (MAD) | 177 |
| Skewness | 55.52751098 |
| Sum | 390108444.1 |
| Variance | 2.358399017 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300 | 2007 | 5.0% |
| 500 | 1609 | 4.0% |
| 400 | 1589 | 4.0% |
| 180 | 1457 | 3.6% |
| 600 | 1227 | 3.1% |
| 350 | 1024 | 2.6% |
| 200 | 987 | 2.5% |
| 800 | 966 | 2.4% |
| 0 | 956 | 2.4% |
| 220 | 949 | 2.4% |
| Other values (2305) | 27229 |
| Value | Count | Frequency (%) |
| 0 | 956 | |
| 1 | 31 | 0.1% |
| 3 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 18 | < 0.1% |
| 20 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 22 | 2 | < 0.1% |
| 25 | 2 | < 0.1% |
| 33 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 38529098 | 1 | |
| 28660000 | 1 | |
| 26769527 | 1 | |
| 25900000 | 1 | |
| 25802236 | 1 | |
| 25570382 | 1 | |
| 25121958 | 1 | |
| 25088724 | 1 | |
| 25075228 | 1 | |
| 24673601 | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 40000 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 40000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 40000 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 730 |
| 3 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 34779 | |
| 1 | 4488 | 11.2% |
| 2 | 730 | 1.8% |
| 3 | 3 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.2 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 40000 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 312.6 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 32100 | |
| 1 | 7900 | 19.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| ID_CLIENT | ID_SHOP | SEX | MARITAL_STATUS | AGE | QUANT_DEPENDANTS | EDUCATION | FLAG_RESIDENCIAL_PHONE | AREA_CODE_RESIDENCIAL_PHONE | PAYMENT_DAY | SHOP_RANK | RESIDENCE_TYPE | MONTHS_IN_RESIDENCE | FLAG_MOTHERS_NAME | FLAG_FATHERS_NAME | FLAG_RESIDENCE_TOWN=WORKING_TOWN | FLAG_RESIDENCE_STATE=WORKING_STATE | MONTHS_IN_THE_JOB | PROFESSION_CODE | MATE_INCOME | FLAG_RESIDENCIAL_ADDRESS=POSTAL_ADDRESS | FLAG_OTHER_CARD | QUANT_BANKING_ACCOUNTS | PERSONAL_REFERENCE_#1 | PERSONAL_REFERENCE_#2 | FLAG_MOBILE_PHONE | FLAG_CONTACT_PHONE | PERSONAL_NET_INCOME | COD_APPLICATION_BOOTH | QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION | FLAG_CARD_INSURANCE_OPTION | TARGET_LABEL_BAD=1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 15 | F | S | 18 | 0 | NaN | Y | 31 | 20 | 0 | P | 216 | Y | Y | Y | Y | 12 | 853 | 0.0 | Y | N | 0 | SARA | FELIPE | N | N | 300.0 | 0 | 0 | N | 0 |
| 1 | 4 | 12 | F | C | 47 | 0 | NaN | N | 31 | 25 | 0 | P | 180 | Y | Y | N | Y | 24 | 35 | 0.0 | Y | N | 0 | JACI | VALERIA ALEXANDRA TRAJANO | N | N | 304.0 | 0 | 0 | N | 0 |
| 2 | 5 | 16 | F | S | 28 | 0 | NaN | Y | 31 | 25 | 0 | O | 12 | Y | Y | Y | Y | 12 | 24 | 0.0 | Y | N | 0 | MARCIA CRISTINA ZANELLA | SANDRO L P MARTINS | N | N | 250.0 | 0 | 0 | N | 0 |
| 3 | 6 | 24 | M | S | 26 | 0 | NaN | N | 31 | 28 | 0 | P | 180 | Y | Y | N | Y | 0 | 999 | 0.0 | Y | N | 0 | MARCIO | ANA | N | N | 800.0 | 0 | 0 | N | 0 |
| 4 | 7 | 55 | F | S | 22 | 0 | NaN | Y | 31 | 12 | 0 | A | 0 | Y | Y | Y | Y | 48 | 999 | 0.0 | Y | N | 0 | FABIO (NOIVO) | EDU (AVO) | N | N | 410.0 | 0 | 0 | N | 0 |
| 5 | 8 | 6 | F | C | 21 | 0 | NaN | Y | 23 | 28 | 0 | A | 24 | Y | Y | Y | Y | 12 | 40 | 800.0 | Y | N | 0 | OLIONA MARIA CAMPOS | ELIZETE CAMPS COELHO | N | N | 248.0 | 0 | 0 | N | 0 |
| 6 | 9 | 3 | F | S | 27 | 0 | NaN | Y | 31 | 20 | 0 | A | 0 | Y | Y | Y | Y | 0 | 950 | 0.0 | Y | N | 0 | SUELI | REGINA | N | N | 1000.0 | 0 | 0 | N | 1 |
| 7 | 10 | 23 | F | C | 57 | 0 | NaN | Y | 31 | 12 | 0 | P | 24 | Y | Y | N | Y | 96 | 13 | 0.0 | Y | N | 0 | MARIA DE LOURDES | ZILDA | N | N | 856.0 | 0 | 0 | N | 0 |
| 8 | 11 | 25 | F | S | 53 | 0 | NaN | Y | 31 | 18 | 0 | P | 60 | Y | Y | N | Y | 24 | 13 | 0.0 | Y | N | 0 | ANA | MARIA MONICA | N | N | 738.0 | 0 | 1 | N | 1 |
| 9 | 12 | 12 | F | C | 32 | 0 | NaN | Y | 31 | 12 | 0 | P | 24 | Y | Y | N | Y | 0 | 165 | 0.0 | Y | N | 0 | ESTELLA OSVALDO CRUZ | ANA MARIA | N | N | 700.0 | 0 | 0 | N | 0 |
Last rows
| ID_CLIENT | ID_SHOP | SEX | MARITAL_STATUS | AGE | QUANT_DEPENDANTS | EDUCATION | FLAG_RESIDENCIAL_PHONE | AREA_CODE_RESIDENCIAL_PHONE | PAYMENT_DAY | SHOP_RANK | RESIDENCE_TYPE | MONTHS_IN_RESIDENCE | FLAG_MOTHERS_NAME | FLAG_FATHERS_NAME | FLAG_RESIDENCE_TOWN=WORKING_TOWN | FLAG_RESIDENCE_STATE=WORKING_STATE | MONTHS_IN_THE_JOB | PROFESSION_CODE | MATE_INCOME | FLAG_RESIDENCIAL_ADDRESS=POSTAL_ADDRESS | FLAG_OTHER_CARD | QUANT_BANKING_ACCOUNTS | PERSONAL_REFERENCE_#1 | PERSONAL_REFERENCE_#2 | FLAG_MOBILE_PHONE | FLAG_CONTACT_PHONE | PERSONAL_NET_INCOME | COD_APPLICATION_BOOTH | QUANT_ADDITIONAL_CARDS_IN_THE_APPLICATION | FLAG_CARD_INSURANCE_OPTION | TARGET_LABEL_BAD=1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39990 | 49989 | 19 | M | S | 28 | 0 | NaN | Y | 31 | 18 | 0 | P | 336 | Y | Y | N | Y | 12 | 514 | 0.0 | Y | N | 0 | SHEILA JUSSARA M CASTELAN | SHIRLEY | N | N | 691.0 | 0 | 0 | N | 0 |
| 39991 | 49990 | 24 | M | C | 58 | 0 | NaN | Y | 50 | 20 | 0 | P | 180 | Y | Y | N | Y | 72 | 999 | 0.0 | Y | N | 0 | SILVANIA / ROSE | AROLDO | N | N | 900.0 | 0 | 1 | N | 0 |
| 39992 | 49991 | 15 | M | C | 43 | 0 | NaN | Y | 31 | 28 | 0 | P | 108 | Y | Y | N | Y | 144 | 921 | 0.0 | Y | N | 0 | CECILIA | MARISA | N | N | 3500.0 | 0 | 1 | N | 1 |
| 39993 | 49992 | 25 | F | S | 23 | 0 | NaN | Y | 31 | 20 | 0 | A | 180 | Y | Y | Y | Y | 24 | 801 | 0.0 | Y | N | 0 | FABIANA | CLAUDIA | N | N | 362.0 | 0 | 0 | N | 0 |
| 39994 | 49993 | 18 | F | C | 38 | 0 | NaN | Y | 23 | 18 | 3 | P | 192 | Y | Y | N | Y | 0 | 999 | 0.0 | Y | N | 0 | GEOVANE S. RIBEIRO | SIDNÉIA MOZER | N | N | 0.0 | 0 | 0 | N | 0 |
| 39995 | 49994 | 1 | M | C | 29 | 0 | NaN | Y | 31 | 12 | 0 | A | 36 | Y | Y | N | Y | 24 | 305 | 0.0 | Y | N | 0 | RUTH | NaN | N | N | 796.0 | 0 | 1 | N | 1 |
| 39996 | 49995 | 12 | F | S | 20 | 0 | NaN | Y | 31 | 20 | 0 | P | 180 | Y | Y | Y | Y | 12 | 712 | 0.0 | Y | N | 0 | DALVA DE AZEVEDO | ANA | N | N | 200.0 | 0 | 0 | N | 0 |
| 39997 | 49996 | 19 | M | S | 21 | 0 | NaN | Y | 31 | 12 | 0 | P | 120 | Y | Y | Y | Y | 12 | 218 | 0.0 | Y | N | 0 | ALBA | DENILSON | N | N | 234.0 | 0 | 0 | N | 0 |
| 39998 | 49998 | 23 | F | S | 23 | 0 | NaN | Y | 31 | 28 | 0 | P | 264 | Y | Y | Y | Y | 12 | 991 | 0.0 | Y | N | 0 | NOVINA | GLAUCIA | N | N | 240.0 | 0 | 0 | N | 1 |
| 39999 | 50000 | 22 | M | S | 29 | 0 | NaN | Y | 31 | 23 | 0 | P | 48 | Y | Y | N | Y | 36 | 26 | 0.0 | Y | N | 0 | TITO MARTINS | NaN | N | N | 341.0 | 0 | 0 | N | 0 |